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Abstract 

A concentration of measure result is proved for tiie number of isolated vertices Y in the Erdos-Renyi 
random graph model on n edges with edge probability p. When /i and denote the mean and variance 
of Y respectively, P{{Y — > t) admits a bound of the form e for some constant positive k under 
the assumption p £ (0, 1) and np c (0, oo) as n ^ oo. The left tail inequality 



Y 



< -t 



< exp 



4/i 



holds for all n £ {2, 3, . . .}, p G (0, 1) and t > 0. The results are shown by coupling V to a random variable 
Y" having the Y-size biased distribution, that is, the distribution characterized by E[Yf{Y)] — iJ.E[f{Y'')] 
for all functions / for which these expectations exist. 
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1 Introduction and main result 

For some n G {1,2,...,} and p G (0,1) let K be the Erdos-Renyi random graph on the vertices V = 
{1, 2, . . . , n} and edge success probability p, that is, with edge indicators Xuv = l{u,v £ V :uv is an edge in K) 
independent random variables with the Bernoulli(p) distribution for all u ^ v. We set Xw = for all u e V. 
Recall that the degree of a vertex u e V, denoted by d{v), is the number of edges incident on v. Hence, 



Many authors have studied the distribution of 

r = ^l(d(i;) = d) 



(1) 



counting the number of vertices v of K having some fixed degree d. We derive upper bounds, for fixed n and 
p, on the tail probabilities of the number of isolated vertices of K, that is, for F in ([T]) for the case d — 0, 
which counts the number of vertices having no incident edg es. 

For d in general, and p depending on n, previously in iKarohski and Rucihskil (|l987l ). the asymptotic 
normalit y of Y was sh own when and np — 0, or np — > c» and np — log n — d log log n — » — oo; 

see also IPalkal (1984) and Bollobas (1985). For the case d = of isolated vertices, Barbour (1982) and 
Barbour et al. ( 1989[ ) show that Y is asymptotic normal if and only if n^p oo and np — \ogn — )■ — oo. 
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He re we study the distribution of Y using a size bias couphng that was used in [Goldstein and Rinott 
(Il996l) io study the rate of convergence to the multivaria te norm al distribution for a vector whose components 
count the number of vertices of some fixed degrees. In iKordeckl (,199Qi) . the mean and variance of Y 
for the particular case d = are computed as 



n{l — p) 



n-l 



and (T^=n(l-p)" \1 + np{l - p) 



yn-2 



{I-pY'-^) forn>2. 



(2) 



In the same paper, K olmogorov distance bounds to the normal of order 0(Var(y) ^Z"^) were obtained. 
O'Connell (1998) showed that an asymptotic large deviation principle holds for Y. I Raid ( 20071) obtains 



nonuniform large deviation bounds for mean zero, variance one random variables in some generality, and 
applies his results to the case of counting the number of isolated vertices with W = (Y — ^)/a, yielding the 
bound 



P{W > t) 



< e*'^W/6(^ ^ Q{t)l3{t)) for all t > 0, 



1 - 

where ^{t) denotes the distribution function of a standard normal variate. 



(3) 



Q{t) 



12 



23 



llV27r 



and 



/3(t) = -^(13 + 43np + 27(np)2) exp J ^ '^'^^^^ 



2np{e 



t/<j 



Still from iRaig (|2007 ). when np — > c as n — ?► oo, ([3]) holds for all n sufficiently large with 

/?(i) = ^expf^ + C3(e^^*/^-l)) 



(4) 



for some unspecified constants Ci,C2,C3 and C4 depending only on c. For t of order ^/n, for instance, 
the function /3(t) will be of order as n — )■ 00, allowing an asymptotic approximation of the deviation 

probability P{W > t) by the normal, to within some factors. 

In Theorem 1 1.1 1 we supply a bound that likewise holds also for all n, and that also gives somewhat more 
explicit information on the rate of tail decay. In particular, we see from ^ that the standardized variable 
W has a left tail that is bounded above by exp{— t^(T^/(4/i)}. Moreover, the right tail also exhibits similar 
bounds over some parameter regions, with a worst case bound there of order exp{— pi} by ([7]) for some p > 0, 
but see also CoroUar v 1 1 . 1 1 for a further improvement in the regime where np converges to a nonzero constant. 

Theorem 1.1. For n £ {2, 3, . . .} and p e (0, 1) let K denote the random graph on n vertices where each 
edge is present with probability p, independently of all other edges, and let Y denote the number of isolated 
vertices in K, having mean fi and variance , as given in (0). Let M{9) = E ex.p{0(Y — fJ,)/^) be the 
moment generating function of the standardized Y variable. Then, letting 

7s = e"(pe'* + 1 -p)""^(npe'' + 1 -p) + (n - l)p+ 1 and H{0) ^ ^ sj,/„ds 

2^'^ Jo 



we have M{e) < expiJ(6') for all 9 > 0, and for all t > 0, 

'Y-fi 



P 



>t\< ini cxp(-et + H(e)). 
a I B>Q 



(5) 



For alio <0 we have M{0) < exp(p9'^ /a'^), and for all t > 0, 



Though integration shows that we may exphcitly write 

/i / np{{n - l)p + l)6i2 + 2o-2 _ 2(j (pe^/'^(a - n9) + a(l - p)) [p {e"''' - l) + l)' 
= ^ ( 



2np 



the integral formula for H(9) in the theorem appears simpler to handle. 

Useful bounds for the minimization in ([S]) may be obtained by restricting to 6' e [0, 6*0] for some Oq. In 
this case, as Js/a is an increasing function of s, we have 



r fore'e[o,( 



The quadratic —9t + ^je^f^O^/ (4ct^) in 9 is minimized a.t 9 — 2ta^ /{fije^f^-)- When this value falls in [0, 9o] 
we obtain the first bound in and setting 9 = 9o yields the second, thus. 



P 



Y 



> t] < 



cxp(- 



exp(-0o^ + 



-) 

4cr2 



^ol■te[0,9o^l^g„/J{2a^)] 
) for t e (6'o/^7e„/^/(2(72),oo). 



(7) 



Inequality ([7]) and the boundedness of Y yields the following useful corollary. 



Corollary 1.1. For all c e (0, 00) there exists a positive constant k depending only on c such that when 
p G (0, 1) and np ^ c G (0, 00) as n 00, 



P{{Y -fi)/a>t)< cxp{~kt^) 



for all t > and all n > 2. 



Proof. Since Y can be no more than n, and cr^ increases at rate n when np — c, there exists a positive 
constant oq such that 



Y 



p n I— 
— < — < aoVn. 



Hence P{{Y - [L)la) > t) = for all t > ao^/n. 

For any given n let 9n — a^^fna^ j [i. Then, as 7^ > 2 for all s > 0, we have (9nP'^egja)li2a'^') > a^^y/n, 
so the first bound in ([7]) applies for all t < aoy/n. Note that 0„/it = ao^/na/p converges to a positive 
constant, implying the convergence of 70„/o-, and hence that of a'^ / (fi'^g^^^), also to a positive constant. 
Since cr"^ / {jJ^Je^/a) is positive for all n > 2, we see that the claim of the corollary holds for all k in the 
nonempty interval {O^inin cr'^ / {fJ-je„/a))- D 

In the asymptotic of Corollary [0] for, say t — ay/n, the function f3{t) of (|4]) behaves like Cj^fn., so the 
bound ([3]) also gives useful information for some range of positive values of a up to some upper limit. However 
as exp(t'^/3(t)/6) behaves like exp(Cna'^/6), when multiplied by 1 — ^(i), of exponential order exp(— a^n/2), 
the product tends to infinity for all sufficiently large a, so the bound in ([3]) may explode before the right tail 
of W vanishes. 

The main tool used in proving Theorem 11.11 is size bias coupling, that is, the construction of Y and Y^ 
on the same space where Y^ has the K-size biased distribution characterized by 



E^f{Y)\ = pE{f{Y^)\ 



(8) 



for all f for which the expectations above exist. In lGhosh and Goldstein ( 2011a ) and Ghosh and GoldsteinI 
(|2011bl ). size bias couplings were used to prove concentration of measure inequalities when jF** — Y\ can 
be almost surely bounded by a constant independent of the problem size. Here , in co ntrast, we apply the 



coupling for the number of isolated vertices of from [Goldstein and RinottI (|1996), which violates the 



bounde dness condition. Unlike the theorem used in lGhosh and GoldsteinI (|2011af ) and 



Ghosh and Goldstein 



(|2011bl ). which can be applied to a wide variety of situations under a bounded coupling assumption, it seems 
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that cases where the couphng is unbounded, such as the one we consider here, need apphcation specific 
treatment, and cannot be handled by o ne single genera l result. 

Having its roots in the work of Ba ldi et al. ( 19891) . a general prescripti on for constructing a v ariabl e 



with the size bias distribution of a sum of nonnegative variables is given in iGoldstein and Rinott ( 1996 ) 



Helped by the fact that size biasing a nontrivial indicator random variable simply sets its value to one, 
specializing to nontriv i al exc hangeable indicators yields the following simplification as in Lemma 3.3 of 



Goldstein and Penrose! ( 2010l) 



Proposition 1.1. Suppose Y — '^^^y o, finite sum of nontrivial exchangeable Bernoulli variables 
{Xy, V e V}, and that for w Cz V the variables {X^ , v Cz V} have joint distribution 

L{X^,v e V) = C{X,,v e V\X^ = 1). 

Then 

has the Y size biased distribution Y'^ , as does the mixture Y^ when V is a random index with values in V, 
chosen independent of all other variables. 

Construction of the variable Y^ is not enough for our purposes; one must couple Y'^ to Y. However, 
Proposition 11.11 suggests a natural coupling. Given the exchangeable indicators {Xy,v £ V} that sum 
to y, choose a summand uniformly and independently. If the summand value is already one, set Y'^ = Y. 
Otherwise, set this variable to one, and 'adjust' the remaining variables to have their conditional distribution 
given that this variable takes on the value one. By Proposition 11.11 the sum of these new variables has 
the Y-size biased distribution. 



2 Proof of Theorem 1.1 

For any graph with vertex set V, for v E V we let N{v) denote the set of neighbors of v, 

N{v) ^{weV:X,^ = 1}, 

where Xyw is the indicator that there exists and edge connecting vertices v and w. We now present the proof 
of Theorem O 

Proof. Following Proposition ll.il we first construct a coupling of Y"^ , having the Y-size bias distribution, to 
Y. Let K be given, and let Y be the number of isolated vertices in K. From ([Ij with d = we see that 
Y is the sum of exchangeable indicators. Let V be uniformly chosen from V, independent of the remaining 
variables. If V is already isolated, do nothing and set K'^ = K. Otherwise, let be the graph obtained 
by deleting all the edges connected to V in K. By Proposition ll.il the variable counting the number of 
isolated vertices of K'^ has the F-size biased distribution. 

Since all edges incident to the chosen V are removed in order to form ^ any neighbor of V which had 
degree one thus becomes isolated, and V also becomes isolated if it was not so earlier. As l{d[w) — 0) is 
unchanged for all w ^ {V} U N{V), we have 

Y' -Y ^di{V) + l{d{V)^Q) where for any I) e V we let = ^ l(d(w) = 1). (9) 

In particular the coupling is monotone, that is, F" > F. Further, since di{y) < d{V), by ([9]) we have 

< F" - F < d{V) + 1. (10) 
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Now note that, for real x ^ y, the convexity of the exponential function implies 

pV — /-l ^ ^ /-l pV I pX 



e 



^v+(i-t)x^^^ / (tey + {I - t)e'')dt 



y ~ X Jo Jq 

and therefore, for all real a;, y, 

\e^-ey\<\x-y\—^. (11) 
Let 61 > 0. Using ([11]) and dTU]), we have 

= i^e'^iY^ - y)(e«(^°-^) + 1)) < (e«^(d(V^) + l)(e«('^(^)+i) + 1)) . (12) 

Clearly the number of isolated vertices y is a nonincreasing function of the edge indicators Xy^, while d{V) + l 
is a nondecreasing fun ction of these same indicators. Hence Y and d{V) + 1 have negative correlations, that 
is, by the inequality of Harrid ([i960). 



E[fiY)gidiV) + 1)] < E[fiY)]E[gidiV) + 1)] (13) 

for any two nondecreasing real functions / and g. In particular, when f{x) = e^^ and g(x) — x{e^^ + 1) with 
X e [0,00), by (112) and (US]) we obtain 

£;(e»^=_e«^) < ^i?e«^i? ((d(y) + l)(e«('^(^)+i) + i; 



(e'E (d(y)e''''(^) + e^'^(^)) + i?(d(y)) + l) . (14) 



To handle the terms in ([H)) . note that for any vertex v the degree d{v) has the Binomial(n — l,p) 
distribution, and in particular 

E{d{v)) = (n - l)p and £;(e'''^('')) = where = {pe^ + 1 - p)""^ 

Hence, as y is chosen uniformly over the vertices v of K , 

E{d{V)) = (n - l)p and £;(e^'^(^)) = ae, (15) 

and now differentiation under the second expectation above, allowed since d{V) is bounded, yields 

E{d{V)e^'^^^^) = (l)g where 0e = (n - l)pe\pe^ + 1 - p)"-^. (16) 

Substituting ^ and dH]) into dH]) yields, for all 6* > 0, 

£;(e»^= _ gSY) < tllE{e'>'') where 7^ = 6*^(09 + ae) + in - l)p + 1. (17) 



Letting m,{9) = E{e^^), using that F is bounded to differentiate under the expectation, along with ([5]) 
and (|17l) . we obtain 

m'((?) = E{Ye''^) = fiE{e''^°) < n (^1 + m(0). (18) 

Standardizing F, we set 

M{9) = £;(exp(6l(y - pi)/a)) = eT^^' " ra{d I a), (19) 



P ( >t] < p(exp( ^i^—A ) > e»M < e~«*M(0) < exp{-0t + Hie)). 



and now by differentiating and applying (jlSp . we obtain 

M'{9) = -e-''^'/''m'{0/a) - ^e-^^'/'^mie/a) 
a a 

< ^e-'^^/^ ( 1 + m{dlo) - \^e-'^l-m{Qla) 

a \ 2a J a 

Since M(0) = 1, integrating M'{s)/M{s) over [0,6*] yields the bound 

\og(M(9)) < H(e), or that M(e) < exp(H(0)) where H(e) = s-f^/ads, 

2o-^ Jo 

proving the claim on M{d) for 6 > 0. Moreover, for 6 nonnegative, 

( 

As the inequality holds for all 6* > 0, it holds for the infimum over 9 > 0, proving ([S]). 

To demonstrate the left tail bound let 6 < 0. Since V >Y and 6 <0, using ([TT]), ^U\j and that F is a 
function of K we obtain 

E{e'^^ - e"^") < ^-^E ((e"^ + e''^°)(r" - F)) < |6l|£;(e^^(y'' - Y)) = \0\E{e'^^ E{Y'' - Y\K)). (20) 
By dS]) have 

E{Y^ Y\K) = - J2{d^{v) + l{d{v) ^ 0)) < 1 ^ d,{v) + 1, (21) 

and noting that 

E'^i(^) = E E i(rfH = i)-E E i(rfH-i)-Ei^Hii(^H==i)^Ei(^H-i), 

the number of degree one vertices in by (PT|) we find that i?(y* — ) < 2. 
Now, by dini) and (EI]), 

£;(ee>'_e«^') < 2|0|£;(e''^) 
and therefore, justifying differentiating under the expectation as before, applying ([5]) yields 

m'(6l) = E{Ye^^) = ^£;(e^^°) > /i (1 + 261) m(6l). 
Again with M(d) as in (fT9|) . 

M'(6') ie-^^/'^m'(6l/CT) - ^e-^^/'^m(6'/cr) 

(T (7 

> /!e-^^/'^((l + 2B I a)m(B I a)) - ^e-^''/'"TO(6l/CT) 
(T a 

lad , , 

Dividing by M{0), integrating over [^,0] and exponentiating yields 

M{e) < exp (^^) , (22) 
showing the claimed bound on M{9) for 9 < 0. The inequality in p2p implies that for alH > and 9 < 0, 

p(^<-.)<exp(.i+^ 
Taking 9 = -t(7'^/(2fi) we obtain ([6]). □ 
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